IS

Chiang, Roger H.L.

Topic Weight Topic Terms
0.351 office document documents retrieval automation word concept clustering text based automated created individual functions major
0.101 personalization content personalized willingness web pay online likelihood information consumers cues customers consumer services elaboration

Focal Researcher     Coauthors of Focal Researcher (1st degree)     Coauthors of Coauthors (2nd degree)

Note: click on a node to go to a researcher's profile page. Drag a node to reallocate. Number on the edge is the number of co-authorships.

Wei, Chih-Ping 1 Wu, Chia-Chen 1
hierarchical agglomerative clustering (HAC) 1 personalized document clustering 1 supervised document clustering 1

Articles (1)

Accommodating Individual Preferences in the Categorization of Documents: A Personalized Clustering Approach. (Journal of Management Information Systems, 2006)
Authors: Abstract:
    As electronic commerce and knowledge economy environments proliferate, both individuals and organizations increasingly generate and consume large amounts of online information, typically available as textual documents. To manage this ever-increasing volume of documents, individuals and organizations frequently organize their documents into categories that facilitate document management and subsequent access and browsing. Document clustering is an intentional act that should reflect individual preferences with regard to the semantic coherency and relevant categorization of documents. Hence, effective document clustering must consider individual preferences and needs to support personalization in document categorization. In this paper, we present an automatic document-clustering approach that incorporates an individual's partial clustering as preferential information. Combining two document representation methods, feature refinement and feature weighting, with two clustering methods, precluster-based hierarchical agglomerative clustering (HAC) and atomic-based HAC, we establish four personalized document-clustering techniques. Using a traditional content-based document-clustering technique as a performance benchmark, we find that the proposed personalized document-clustering techniques improve clustering effectiveness, as measured by cluster precision and cluster recall.